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Motivated by potential applications to mobile radio, we studied variable- 
bit-rate speech communication through Gaussian-noise and Rayleigh-fading 
channels. For convenience we used a constant signaling rate of 32 kb/s and 
adjusted the source-coding and channel-coding rates in response to changing 
transmission quality. When the channel quality was good enough, we used all 
32 kb/s for speech transmission. When the channel quality was lower, we 
reduced the source rate to 24 or 16 kb/s and introduced channel coding to 
control distortion due to transmission errors. We concentrated on specific 
source and channel codes that could be implemented with hardware of modest 
complexity. The source code was embedded differential pulse code modulation, 
which is amenable to variable-bit-rate operation and economical to implement. 
For error control we introduced punctured convolutional codes and a Viterbi 
decoder with only 16 states. Although the source/channel codec was simple, it 
offered good performance. Speech quality was at the level of normal telephony 
when the channel was good; the error-correcting codes extended by up to 13 
dB the range of channel signal-to-noise ratios that support adequate quality. 
Our performance estimates were based on a new analysis of transmission 
errors in embedded differential pulse code modulation and on computer 
simulations of speech transmission through fixed and fading channels. 

I. INTRODUCTION 
1.1 Motivation 

Every communication channel has an optimum rate for digital 
transmission of speech. If the actual rate is below the optimum, speech 
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quality is impaired by unnecessary quantizing distortion. If the rate is 
higher than optimum, the distortion due to transmission errors is 
excessive. These observations are embodied in rate-distortion theory, 
which provides theoretical bounds on performance. Implicitly, they 
influence practical systems in which design decisions are compromises 
between goals of quality, bandwidth efficiency, and equipment econ- 
omy. 

When the properties of the communication channel are predictable 
and invariant, one may explore this compromise and arrive at an 
acceptable design. However, in many instances (for example, in 
switched telephony) prior knowledge of the channel is only statistical, 
and in others (fading microwaves) the channel changes drastically 
with time. With these channels, performance is characterized statis- 
tically by the fraction of users experiencing each possible level of 
quality or by the fraction of time a single user experiences each quality 
level. 

Both spatial and temporal channel fluctuations are inherent in 
mobile telephony. Because channel characteristics depend on the 
position of the mobile unit, all users have, at any time, different 
channels and, because mobile units are moving, their channels change 
with time. The conventional approach to this statistical nature of the 
channel is to set design goals of acceptable quality for a large fraction 
(say 90 percent) of users or to set "outage" limits, i.e., small fractions 
of time (say 10 percent) when quality is allowed to fall below a certain 
specification. In meeting these statistical requirements with a fixed 
coding and modulation scheme, we have a system in which perform- 
ance is worse than necessary for most users most of the time. The 
common deficiency is too much quantizing noise, because 90 percent 
of the channels are better than the marginal one for which the system 
is designed. The other 10 percent of the users have too much distortion 
because of transmission errors. 

In this paper we investigate variable-bit-rate transmission, with the 
information rate adjusted according to the properties of the channel. 
To facilitate implementation, we maintain a constant signaling rate 
through the channel and reciprocally adjust the information rate of 
the source and the amount of channel coding for forward error correc- 
tion. The result, relative to fixed-rate transmission, is improved grade 
of service or, for the same grade of service, the ability to serve more 
users within each geographical area. 1 

We limited the content of this paper to transmission components 
of a variable-bit-rate system: the source and channel codecs and their 
performance in two types of channel. Control components, which 
involve measurements of channel quality and two-way communication 
between user pairs, require further study. 
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1.2 Source and channel codes 

The most general variable-bit-rate scheme requires control infor- 
mation, an adjustable voice coder/decoder (codec), an adjustable chan- 
nel codec that provides forward error correction to combat channel 
impairments, and perhaps an adjustable modem (modulator/demodu- 
lator), as indicated in Fig. la. In our studies, we have focused on 
specific source and channel codes that are simple to implement and 
are especially well suited to variable-bit-rate operation. We do not 
claim that these codes are optimum, even within a constraint of limited 
hardware complexity. However, they are strong candidates for practi- 
cal implementation and this study of their performance provides 
valuable insights into general principles of variable-bit-rate transmis- 
sion. 

The source codes are embedded Differential Pulse Code Modulation 
(DPCM), 2 which can be implemented with fixed analog-to-digital and 
digital-to-analog converters and simple digital control of the number 
of bits transmitted. The channel codes are convolutional codes, with 
rates 1/2, 2/3, and 3/4, that provide significant coding gains with 
maximum likelihood decoding. At rates 2/3 and 3/4, "punctured" code 3 
realizations simplify the decoder and lend themselves to variable-bit- 
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Fig. 1— Elements of a variable-bit-rate system, (a) Adjustable source codec, channel 
codec, and modem, (b) Fixed embedded source codec and modem. The control of the 
source rate is simple. 
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rate operation because the encoder and decoder structures are the 
same as for the rate 1/2 code. In fact, the punctured codes are rate 
1/2 codes with a fraction (1 out of 4 or 2 out of 6) of the channel bits 
deleted. 

In the combined source/channel codec the channel signaling rate is 
constant; when the channel deteriorates, the least significant speech 
bits are deleted and the most significant speech bits are protected by 
a convolutional code. Thus the variable-rate encoding and decoding 
can be performed, as indicated in Fig. lb, with a fixed speech codec, a 
fixed modem, and modest additional hardware relative to fixed-rate 
operation. Furthermore, the performance characteristics of the em- 
bedded source code and the punctured channel code are virtually as 
good as their conventional counterparts, which require complicated 
adjustments in a variable-bit-rate environment. 

To explore the principle of combining embedded DPCM and punc- 
tured convolutional codes we have studied channels with signaling 
rates of 32 kb/s. We consider four speech transmission formats: 

1. All 32 kb/s used for speech transmission 

2. 24 kb/s speech transmission, all speech bits protected by a rate 
3/4 code 

3. 24 kb/s speech transmission, the most significant 2 bits of every 
3-bit code word protected by a rate 1/2 code 

4. 16 kb/s speech transmission, all speech bits protected by a rate 
1/2 code. 

These transmission formats, listed in order of increasing resistance to 
channel impairments, are summarized in Table I, which also presents 
properties of the convolutional codes discussed in Section HI. 



Table I— Source and channel code formats, convolutional code 
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1.3 An example of performance 

Figure 2 pertains to phase-shift-keying modulation in a Rayleigh- 
fading channel. The curves show speech quality, expressed as segmen- 
tal signal-to-noise ratio (s/n) (Section 2.4.2), as a function of channel 
s/n. This is defined as the ratio of average energy per channel symbol 
to noise power per Hertz. Format 1 (32 kb/s speech transmission) is 
the best choice when the channel is very good (s/n > 22 dB), but it is 
also the most vulnerable to transmission impairments. By contrast 
Format 4 (16 kb/s speech), with substantially more quantizing distor- 
tion, can pass through very poor channels with no added degradation. 
Adaptive DPCM at 16 kb/s is highly intelligible, though somewhat 
fuzzy. Assuming that the threshold of "adequate" performance is 0.5 
dB less than the s/n of 16 kb/s error-free transmission, we see in Fig. 
2 that Format 4 extends the useful range of channel quality by 13 dB 
relative to Format 1. 




-E 15- 



10 20 30 
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Fig. 2 — Performance of the four code formats as a function of the s/n of a Rayleigh- 
fading channel. If a segmental s/n of 9.5 dB represents "adequate" performance, the 
variable-rate mechanism extends the threshold of usable channel s/n's from 16 dB 
(Format 1) to 3 dB (Format 4). 
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1.4 Organization of this paper 

In the following three sections we describe the source and channel 
codes and present theoretical performance bounds and measures ob- 
tained from computer simulations of speech communication over 
idealized channels. The theory of the effects of transmission errors in 
embedded DPCM will be published in a separate paper, 4 which extends 
to embedded DPCM Sundberg and Rydbeck's analysis 5 " 7 of Pulse 
Code Modulation (PCM) s/n in noisy channels. Our approach to 
combined source coding and channel coding is similar in spirit to the 
work of Modestino and Daut on image transmission.*" 10 Section V 
lists several issues to be addressed in evaluating applications of these 
techniques to mobile radio communication. 

II. EMBEDDED DIFFERENTIAL PULSE CODE MODULATION 
2.1 Encoder and decoder 

Within the bit stream of an embedded code is a slower bit stream 
that represents the analog signal source with reasonable quality. 
Embedded coding simplifies variable-bit-rate communication. The 
analog-to-digital and the digital-to-analog converters always operate 
at the same high rate. If necessary, the transmission system deletes 
bits from the source code and adds filler bits prior to decoding. 
Although PCM is an embedded code, differential PCM, which is more 
efficient for speech transmission, is not. On the other hand, minor 
modifications of the conventional DPCM encoder and decoder do 
produce an embedded code. In the embedded codec, the integrators in 
the encoder and decoder process a low-bit- rate representation of the 
error signal. Then, when bits are deleted in transmission, the same 
signal arrives at both integrators and large errors are avoided. 

Figure 3 models the embedded DPCM encoder and decoder as the 
combination of two separate codecs: a "minimal" DPCM codec oper- 
ating with 2 bits/sample, and a "supplemental" PCM codec, which 
transmits the quantization error of the minimal encoder. The supple- 
mental codec operates at 2 bits/sample and the combination of mini- 
mal and supplemental quantizers can be viewed as a two-stage, suc- 
cessive-approximation realization of a 4-bit/sample quantizer. The 2 
bits/sample of the minimal codec are always transmitted, while one 
or both bits of the supplemental codec can be deleted to reduce the 
transmission rate from 32 kb/s (4 bits/sample) to 24 or 16 kb/s (3 or 
2 bits/sample). The codec functions properly because the encoder and 
decoder predictors operate on the same signal regardless of the number 
of bits deleted prior to transmission. 

Relative to conventional DPCM, the feedback loop of the embedded 
DPCM encoder operates with reduced resolution when more than 2 
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bits/sample are transmitted. Reference 2 shows that the impact on 
performance due to this reduced resolution is small, causing no more 
than a 0.7 dB decrease in s/n in the single-integration codec studied 
here. (On the other hand, conventional DPCM is unsuited to variable- 
bit-rate operation. If the transmission system deletes 1 or 2 bits/ 
sample there is a 5.5-dB s/n penalty.) 

2.2 Effects of transmission errors 

Reference 4 presents a detailed analysis of transmission errors in 
embedded DPCM. Specialized to a single-integration codec with pre- 
diction coefficient a, a key result of that analysis is the formula for 
the error in the fcth sample: 

x'(k) - x(k) = n q (k) + e A (k) + £ a'euik - i), (1) 

in which x(k) and x'(k) are the encoder input and decoder output, 
respectively. On the right side of (1), n q (k) is the quantizing distortion 
of the 2, 3, or 4 transmitted bits/sample, e A (k) is the noise due to 
binary errors in all of the transmitted bits, and e M (k) is the noise due 
to binary errors in the 2 bits/sample of the minimal quantizer. Equa- 
tion (1) shows that the errors in these two bits are amplified by the 
integrator in the decoder. 

2.3 Audio signal-to-noise ratio 

Our analytic tools 4 enable us to compute the mean-square value of 
(1) under the condition that the quantizer is in a granular state (output 
and input differ by no more than half a quantizing step). The derived 
signal-to-noise ratio, therefore, fails to reflect overload distortion. This 
apparent deficiency does not limit the value of the granular mean- 
square error as a predictor of speech quality. Subjective tests show 
that granular noise and overload are quite different perceptually and 
that adding their squares is often misleading. 

A similar issue is raised by the addition in (1) of a granular-noise 
term (n q ) to e A and eM, which result from transmission errors. Is the 
mean-square sum a meaningful measure of signal quality? We are 
optimistic that it is because, contrary to overload, which is signal- 
dependent, granular noise and noise due to transmission errors are 
essentially uncorrected with the input. With this rationale we have 
derived signal-to-noise ratio formulas of the form 4 

E[x(k)] 2 _ C 

S/n E[x'{k)-x(k)] 2 «i+«V 

in which C depends on the codec configuration and the source statis- 
tics, <Tq is the quantizing noise power, and o\ t is the distortion due to 
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transmission errors, including the effects of correlation between quan- 
tizing noise and noise due to transmission errors. 

With single-integration DPCM and a 2-bit minimal quantizer 

r I ~ ° 2 *<*/* (3) 

L ~ L 2 (l - 2ap + a 2 ) W 



and 



73, (4) 



where D (2, 3, or 4) is the number of bits transmitted, a is the predictor 
coefficient of the DPCM feedback loop, p is the adjacent-sample 
autocorrelation coefficient of the encoder input signal, x(k), and L is 
the quantizer load factor: the ratio of overload point to rms quantizer 
input. 

The other quantity in (2), a%, depends on the binary representation 
of quantizer output signals and on the transmission conditions includ- 
ing the modulator and demodulator, the channel, and the encoder and 
decoder (if they are included) for forward error correction. Table II 
contains the formulas for o%_ for the four transmission formats studied 
in this paper. The formulas pertain to a sign-magnitude binary rep- 
resentation and a quantizer with "3.16-sigma loading", i.e., the quan- 
tizer overload point is VTo times the rms value of the quantizer input. 

Table II — Formulas for computing audio signal- 
to-noise ratio 

General formula 

(l-q 2 L 2 /48) 1 

S/ " L 2 (l - 2a P + a 2 )V ql + 2' 2D /3) 
Coder parameters 
Predictor coefficient a = 0.85 

Quantizer load factor L = vlO 
Input signal 

Correlation coefficient p = 0.85 
Transmission effects, sign-magnitude representation 

P: binary error probability of unprotected bits 

P e : binary error probability of convolutional code 
Format 1: D = 4 (no channel coding) 

ff 2 qt = 0.723P + 0.271P 2 + ° 2 (0.725P + 0.275P 2 ) 

Format 2: D = 3 (rate 3/4 code) 

1 a 2 

a% = 0.25P. (3.37 + 2 1.73 

Format 3: D = 3 (rate 2/3 code) 



ff 2 , = 0.5P, ( 1.52 + 2 1.73 ) + 0.0625P 

Format 4: D = 2 (rate 1/2 code 



ff 2 t = P. 1 1.56 + ——j 1.73 
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2.4 Speech transmission 

2.4.1 Adaptive DPCM 

Practical DPCM codecs have time-varying step sizes to accommo- 
date the wide dynamic range of speech sounds. In this study, we have 
used a robust adaptive quantizer 11 in which the step size of the encoder 
changes with each sample according to the rule 

6 k+1 = Mbl (5) 

where 6 k is the step size used to encode the fcth sample, M is a 
multiplicative factor that depends on the most recent encoder output, 
and /? is a leakage factor that helps the codec work properly in the 
presence of transmission errors. 

At the decoder the step size is b' k with 

6' k+1 = M'(b' k Y, (6) 

where M ' depends on the received code word. It car. differ from M if 
there has been an error in transmitting the fcth code word. 

To make the adaptive quantizer work with embedded DPCM, we 
restrict M to one of two possible values: 

M = M x < 1 

if the input is in the lower half of the quantizer range, and 

M = M 2 > 1 

otherwise. Then with four-bit code words represented in a sign- 
magnitude format, M and M ' depend only on the most significant 
magnitude bit. As a consequence the decoder step size can track the 
encoder step size when one or both of the two least significant bits are 
deleted. 

In the simulation studies reported in this paper the adaptive quan- 
tizer constants were M x = 0.85, M 2 = 1.5, and /3 = 0.98. We selected 
these values because they offer a good compromise between perform- 
ance over an ideal channel and tolerance of transmission errors. 

2.4.2 Segmental signal-to-noise ratio 

In addition to the limitations discussed in Section 2.3, the s/n 
formulas of Table II are of limited value in evaluating adaptive DPCM 
coding of speech because they do not account for errors due to different 
step sizes at the transmitter and receiver. On the other hand, segmental 
s/n is a quality measure that is reasonably well correlated with listener 
reports of the quality of speech transmitted by adaptive DPCM over 
noisy channels. It is defined as the average of the s/n measured in 
decibels over short segments of the signal, and it can be calculated in 
a straightforward manner in computer simulations of speech trans- 
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mission. 12 In our simulations we measure s/n over 16-ms segments 
(128 samples) and exclude from the average segments detected as 
silent (power at least 55 dB below the saturation level of the quantizer). 
We also limit the s/n of each segment to the range —10 to 80 dB. 

III. CONVOLUTIONAL CODING AND DECODING, PSK MODULATION 

Our design, analysis, and implementation of the rate 3/4, 2/3, and 
1/2 coders and decoders for the most part follow Chapter 6 of Ref. 13. 
The punctured codes 3 (rate 1/2 codes with a fraction of the channel 
symbols deleted) are especially well suited to variable-bit-rate opera- 
tion because the encoder and decoder retain their basic structures as 
the code format changes; the things that do change are a set of 
constants (code generators) and the switching patterns that govern 
the output of channel symbols at the encoder and the input of channel 
symbols at the decoder. 

3. 1 Code configuration 

In our calculations and simulations, we use codes generated accord- 
ing to Fig. 4. With a five-stage shift register at the encoder, the codes 
have 4-bit memory (16 unique nodes in the code trellis). This number 
provides an attractive compromise between performance (enhanced 
by long memory) and decoder simplicity (short memory). When the 
rate 1/2 code is employed, the DPCM signal arrives at 16 kb/s, and 
for each input bit, the switch at the encoder gets one output bit from 
the top branch of Fig. 4 and one output bit from the bottom branch. 
When the rate is 2/3, the two most significant bits in each 3-bit DPCM 
code word stimulate three output bits from Fig. 4. The first of the two 
DPCM bits results in two output bits — one from the upper branch of 
Fig. 4 and one from the lower branch, as in the rate 1/2 code. When 
the second DPCM bit arrives, the encoder releases one bit from the 
upper branch. Along with these three bits from the convolutional 
encoder the least significant DPCM bit is transmitted without protec- 
tion. When the rate is 3/4, the first member of every block of three 
input bits results in two output bits (from upper and lower branch of 
Fig. 4); the remaining two input bits in the block stimulate one output 
bit each (both from the lower branch). The code generators and 
switching patterns are listed in Table I. 

The receiver is a maximum-likelihood decoder, implementing the 
Viterbi Algorithm. A signal processor produces a measurement (be- 
tween and 1) for each received bit. This measurement is near if 
there is a high likelihood that the bit was transmitted as zero and near 
1 if there is a high likelihood that one was transmitted. With each 
block of two (rate 1/2), three (rate 2/3), or four (rate 3/4) received 
bits, the Viterbi decoder accepts combined measurements of the first 
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two bits and updates the metrics and path memories associated with 
the 16 possible decoder states. The decoder then releases one bit 
corresponding to an input bit delayed by the length of the decoder 
path memory. (In the simulations reported here the path memory was 
30 bits for all codes.) If the rate is 2/3 or 3/4, the measurement of 
each remaining bit in the block causes the decoder to update all 
metrics and path memories and to release one additional bit. 

3.2 PSK modulation, likelihood measurements 

To study the performance of variable-bit-rate transmission with 
embedded DPCM and convolutional codes we have confined our 
attention to Binary Phase-Shift Keying (BPSK) modulation with 
coherent detection. In a Gaussian-noise channel the demodulator 
output for each channel symbol is either 

r = —A + n or r = A + n, 

depending on whether "zero" or "one" was transmitted. The constant, 
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A, is the received sinewave amplitude and the noise, n, is a sample of 
a zero-mean normal random variable with variance a 2 . The symbol 
s/n is p = A 2 /2a 2 . In our computer simulations, the likelihood meas- 
urement for each symbol is m = 0.5 + r/const, where the constant is 
large enough to ensure < m < 1. (With m effectively unquantized, 
as in our simulations, the constant is otherwise arbitrary. With m 
quantized to a few bits the constant is a compromise between dynamic 
range and resolution.) 

To update the state metrics of the Viterbi decoder we add morl- 
m, depending on whether the relevant branch of the code tree is 
associated with zero or one transmitted. For a constant-amplitude, 
white-Gaussian-noise channel, this procedure produces a precise like- 
lihood metric. In a fading channel, which is of principal interest in 
mobile radio, this metric is only an approximation to the likelihood 
function. 14 A true likelihood metric involves elaborate computations 
that would probably be prohibitively complicated in practice. 

3.3 Probability of error 

The theory of convolutional coding provides upper (union) bounds 
on bit-error probability as a function of s/n. These bounds are infinite 
series that are truncated for the purpose of numerical computation. In 
our calculations we have used sums of five terms to estimate the error 
probabilities of the convolutional codes: 

Pe = r I W n f n M, (7) 

n =d 

where d is the free distance of the code; b is the number of source bits 
per code word (1, 2, 3 for rate 1/2, 2/3, 3/4); and/„, c (p) is the probability 
that an incorrect path, n bits removed from the correct path, has a 
lower metric than the correct path. This probability is a function of 
the type of channel, (C = g for a Gaussian channel, C = R for Rayleigh 
fading), the modulation technique, and the s/n p. 

Table I contains the free distances, d, and the weights, w„, of the 
three convolutional codes we have studied. For BPSK with coherent 
detection we have the precise formula for a nonfading channel, 



Up) = i 1 



exp 

2np 



dx. (8) 



For independent Rayleigh fading signals in white Gaussian noise, we 
do not have a precise formula for /„,r(p). Instead we use the upper 
bound 14 



/ n , R (p) < 



exp(0/p) 



(9) 



Vl + 2/P + 1. ' 
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0= Vp 2 + 1 - l. 



(10) 



Figures 5 and 6 show estimates of binary error rates computed from 
(7) through (10) and the results of computer simulations of random 
data transmitted through a convolutional coder, a random channel, 
and a Viterbi decoder with path memory 30 bits. In Fig. 5 the 
simulations and calculations for the nonfading channel agree very 
closely. On the other hand Fig. 6 suggests that the bound in (9) is 
quite loose over the range of conditions of interest to us here; the 
estimated channel s/n corresponding to a given error rate is 3 to 4 dB 
lower in the simulations than in the computed curves. Figures 5 and 
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Fig. 5— Binary error rates of convolutional codes and uncoded Phase-Shift Keying 
(PSK) in a nonfading channel. The theoretical curves and the results of computer 
simulations are in close agreement. 
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Fig. 6— Binary error rates of convolutional codes and uncoded PSK in a Rayleigh- 
fading channel. The theoretical curves, based on upper bounds on path error probabili- 
ties, are loose over the range of error rates (10~ 4 to 10~ 2 ) of interest to us here. 



6 also show P, the binary error rates for uncoded transmission, which 
are P = f\&(p) in (8) for the nonfading channel and 



H 



i - 



i + p 



(ii) 



with Rayleigh fading. 15 

Note that the independent variable in Figs. 5 and 6 is the s/n of 
channel symbols rather the s/n of information bits, which is often 
plotted in comparisons of coding schemes for digital data transmission. 
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For our purpose the channel s/n is an appropriate measure because it 
remains constant as we change code formats. 

IV. PERFORMANCE 

To evaluate the potential effectiveness of variable-bit-rate trans- 
mission we have computed audio s/n as a function of channel s/n 
using the formulas in Table II and (7) through (10). Figure 7 applies 
to a nonfading Gaussian channel and Fig. 8 applies to independent 
Rayleigh fading signals (the s/n's of all channel symbols are mutually 
independent) in Gaussian noise. All curves pertain to a sign-magnitude 
representation of source symbols, which is generally more tolerant of 
transmission errors than the natural-binary representation. 

We assume that an audio s/n within 0.5 dB of the s/n of error-free 
16 kb/s transmission provides "adequate" voice quality; therefore, in 
Fig. 7 we estimate that relative to conventional 32 kb/s transmission, 
variable-bit-rate operation extends the range of useful channel con- 
ditions by 4.3 dB in a nonfading channel. (Without channel coding 
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Fig. 7 — Calculated performance of the four code formats as a function of the s/n of 
a nonfading channel. Convolutional coding extends the range of channel s/n's that offer 
adequate communication by 4.3 dB. 
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voice quality is inadequate when the channel s/n is less than 4.5 dB. 
With coding this threshold is 0.2 dB.) Figure 8 suggests that with 
Rayleigh fading the coding gain is 9.2 dB. 

The theoretical formulas, though simple, rely on many bounds, 
unverified assumptions, and idealizations of a practical communica- 
tions environment. To obtain more realistic estimates of performance 
we have resorted to computer simulations of adaptive DPCM trans- 
mission of a 2.5-second speech sample. The simulated Viterbi decoder 
has a path memory of 30 bits. Performance measurements, with speech 
quality measured as segmental s/n (see Section 2.4.2), are shown in 
Fig. 9 for a nonfading channel and in Fig. 2 for the Rayleigh-fading 
channel. 

We see that the approximations in Figs. 7 and 8 provide the same 
qualitative information as the speech simulations in Figs. 9 and 2. For 
the nonfading channel, simulations and theory lead to nearly equal 
estimates (4.3 dB and 4.7 dB) of coding gain. In the case of Rayleigh 
fading the theory underestimates this gain (by 3.8 dB) relative to 
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Fig. 9 — Performance of the four code formats in simulated speech transmission over 
a nonfading channel. Coding lowers by 4.7 dB the channel s/n required for adequate 
quality. 



simulation. This discrepancy is largely due to the loose upper bound 
on the binary error rate of the Viterbi decoder in a fading channel. 
(See Fig. 6 and Section 3.3.) 

All of the performance curves show that transmission Format 2 (24 
kb/s speech, all bits protected) is of very limited value. When the 
coded bits are essentially error free, Format 2 is only slightly better 
than Format 3 (24 kb/s, 2 of 3 bits/sample protected). In this condition, 
the transmission noise of Format 3 is due to errors in the third, 
uncoded, bit, which have a small effect on the quality of embedded 
DPCM because the errors are not enhanced by the receiver integrator. 
(In nonembedded DPCM Format 2 would be more effective than 
Format 3 over a wide range of channel conditions. 4 ) In a practical 
application, therefore, Format 2 would be omitted and the system 
would switch among Formats 1, 3, and 4. 

V. CONCLUSIONS: APPLICATIONS TO MOBILE RADIO 

We have described the key elements of a combined source/channel 
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codec and evaluated its ability to communicate speech over idealized 
channels. While this approach has other possible applications, we were 
motivated to study its potential for enhancing mobile radiotelephony. 
A detailed study of the codec in a mobile-radio environment is cur- 
rently under investigation. We conclude this paper by reviewing the 
features of the scheme and listing important issues to be addressed in 
assessing its value in a mobile-radio context. 

A principal advantage of the source codes and channel codes is their 
simplicity. The embedded DPCM coder and decoder can be realized 
on a single-chip microcomputer 16 and the Viterbi decoder, with only 
16 states, is within the state-of-the-art of special-purpose integrated 
circuits. 17 Because the channel signaling rate is constant, no special 
modem is required. Notwithstanding this simplicity, the speech quality 
for good channels is comparable to that of conventional telephony 
and the extension of the useful range of channel conditions (4.5 dB in 
a nonfading channel, 13 dB with independent Rayleigh fading) is 
substantial. 

To extend this idealized study to the context of variable-bit-rate 
communication over mobile-radio channels, we are led to study the: 

1. Control mechanism for altering code formats 

2. Channel characteristics, including the temporal nature of the 
Rayleigh fading and the geographical distribution of signal-to-inter- 
ference ratio in a mobile-radio service area 

3. Alternative modulation techniques to binary-phase-shift keying 

4. Possibility of diversity reception. 
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